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TITLE: FAST COMPUTATION OF MIMO DECISION FEEDBACK EQUALIZER 
COEFFICIENTS 

SPECIFICATION 

5 1. FIELD OF THE INVENTION 

This invention relates generally to digital communications; and more particularly to 
decision feedback based equalizers that are employed in digital communication systems. 
2. BACKGROUND OF THE INVENTION 

The structure and operation of communication systems is generally known. Many 

10 communication systems carry data, e.g., voice, audio, video, file, or other digital data that is sent 
from a transmitter to a receiver. On the transmitter side, data is first formed into packets. This 
data may be raw data or encoded data that represents the raw data. Each of these packets also 
typically includes a header, a known training sequence, and a tail. These packets are then 
modulated into symbols and the symbols are transmitted by the receiver and intended for receipt 

15 by the receiver. The receiver then receives the symbols and attempt to extract the data from the 
packets that are carried by the symbols. 

A "channel" carries the symbols from the transmitter to the receiver. The chaimel is 
serviced by a wired, wireless, optical, or another media, depending upon the conmiunication 
system type. In many communication systems, such as terrestrial based vdreless communication 

20 systems, satellite based communication systems, cable based conmiunication systems, etc., the 
channel distorts the transmitted symbols, from the perspective of the receiver, causing 
interference between a subject symbol and a plurality of symbols surrounding the subject 
symbol. This type of distortion is referred to as "mter-symbol-mterference" and is, generally 
speaking, the time-dispersed receipt of multiple copies the symbols caused by multipath. The 

25 channel also introduces noise into the symbols prior to their receipt Each of these concepts is 
well known. 

Equalizers are now generally employed in an attempt to remove chaimel effects from a 
received symbol stream. Thus, equalizers are essential building blocks of modem receivers, 
especially in broadband applications where mter-symbol-interference is a critical problem. In a 
30 typical equalizer, the channel between the transmitter and the receiver is first estimated based 
upon the training sequence contained in one or more preambles. Then optimal equalizer 
coefficients (also referred to as taps and/or tap coefficients for the equalizer) are estimated based 
upon the channel estimate. The optimal equalizer coefficients are then used by the equalizer in 
extracting the data from the packet. The optimal equalizer coefficients may also be computed 

1 
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after extraction of the data from the equalized data stream based upon blind channel estimates. 
Equalizer coefficient generation should be repeated as often as possible, especially in fast 
varying channel cases, to generate new equalizer coeflRcients. The received data stream is 
usually buffered during the period that is required for channel estimation and equalizer 
5 coefficient computations. Thus, the preamble (and also actual data) contained in a packet may 
be used to generate the channel estimate and optimal equalizer coefficients that are employed by 
the equalizer to extract the data from the packet. 

As symbol rates increase and modulation schemes become more complex, equalizers 
have increasingly greater importance. A critical factor in increasing the effectiveness of these 

1 0 equalizers is the complexity of optimal equalizer coefficient computatioa A reduction in this 
complexity: (1) reduces the memory size required to buffer the received symbol stream 
sequence during the period required for coefficient computations; (2) allows more frequent 
uploading of new coefficients thus enabling the equalizer to track fast channel variations; and (3) 
simplifies the hardware and, resultantly, the die area required for coefficient computation.FIG. 1 

15 is a block diagram illustrating a discrete time symbol-spaced Decision Feedback Equalizer 
(DFE) based channel equalization model 100. The channel equalization model 100 includes a 
channel 102, a Feed Forward Equalizer (FFE) 104, a Decision block 106, and a Feed Back 
Equalizer (FBE) 108. An input sequence jc(«) is complex, independent and identically 
distributed with xmit power. Additive noise v(w) is white Gaussian with power spectral density 

20 . Furthermore, the decisions xQt -S) die assumed to be correct, and hence equal to 

x(n - S) . This assumption makes the design of the FBE 108 and FFE 104 easier, but at tiie 
expense of introducing error propagation due to possibly wrong decisions. The FFE 1 04 
function G(z) has length L The channel (impulse) response vector of tiie channel h is given in 
Equation (1) as: 



Hie number of coefficients (taps) M of the FBE 108 function B(z) is assumed greater or equal to 
the channel memory, i.e., M ^ iV - 1 . These modeling assumptions are feasible in practice. 

In estimating FFE 104 and FBE 108 equalizer coefficients, the goal is to namimize the 
mean square error quantity of Equation (2). 



25 



Equation (1) 



30 



Equation (2) 
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where x(n-S) is the delayed mput signal estimate prior to the Decision block 106. By 
collecting the coefficients of both G(z) and B(z) into vectors, we can express the received signal 
i(w - ^) in Equation (3) as: 

x„^yng-x„b Equation (3) 

A channel ou^ut model defining y„ may be expressed by: 

y„ = x„H + v„ Equation (4) 

where J? is the {N + L-l)xL convolution matrix corresponding to. the channel response and 

expressed as: 



m 



0 



h{N-\) h{N^2) 
0 KN'-l) 



0 
0 

A(0) 

Ki) 



0 0 

10 In this model, x„ is the 1 x (iV + i - 1) mput vector, 

x„ = [jc(«) jc(n-l) x(«-JV'-i + 2)] 
j/^ is the 1 X i input regression vector to the FFE 104, 

x„ is the 1 X M input regression vector to the (strictly causal) FBE 108, 
15 jc„ = [jc(n-<y-l) Jc(«-<y-2) — xin-S-M)] 



Equation (5) 



Equation (6) 



Equation (7) 



Equation (8) 



and v„ is the 1 X Z vector noise process. 

The current efficient methods for computing the optimal filter coefficients of a decision 
feedback equalizer, which optimizes (1), are based on the well-known Cholesky decomposition 
20 method (from a finite-dimension problem formulation). Two published pf^^ers: (1) N. Al- 
Dhahir and JJVI. Cioffi, '*MMSE Decision-Feedback Equalizers: Finite-Length Results," IEEE 
Trans, on Information Theory, voL 41, no. 4, pp. 961-973, July 1995; and (2) N. AI-Dhahir and 
J,M. Cioffi, "Fast Computation of Channel-Estimate Based Equalizers in Packet Data 
Transmission," IEEE Trans, on Signal Processing, vol. 43, no. II, pp. 2462-2473, Nov. 1995 
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provide one procedure for computing optimal DFE settings. These equations are referred to 
hereinafter as "Al-Dhahir's equations." 

Generally speaking, Al-Dhahir's equations, as well as other existent techniques rely on the 
use of the generalized Schur . algorithm for fast Cholesky decomposition of the matrices involved 
5 in both the FBE and FFE optimal coefficient setting computations. However, the overall 
procedures for calculation of the DFE (FBE and FFE) coefficients have the following problems: 

1. These procedures require the use of nonstructured recursive equations. These equations 
are difBcult to implement from the perspective of mtegrated circuit design. In particular, 
the recursive equations used in Al-Dhahir's DFE tap (coefficient) computer often 

10 requires the use of a DSP processor. As is generally known, the use of a DSP processor 

in a real-time communication system application is severely constricts system 
throughput. 

2. These procedures, in particular Al-Dhahir's equations require a complex tap computer. 
In other words, they require the xise of a relatively large mmiber of complex multiplies. 

15 3. These prior art DFE coefficient computation techniques can be used only for equalizers 
that use a fixed equalizer delay (S), which is set to its maximum value L-l, where L is 
the length of the FFE 104. The prior art techiuques cannot use a dififerent delay S of the 
equalizer. 

Thus, there is a need in the art for a new equalizer coefficient computation methodology 
20 and system in which such methodology may be implemented. 

SUMMARY OF THE INVENTION 
In order to overcome the above-cited shortcomings, among others of fee prior art, the 
method and system of the present invention computes optinud Decision Feedback Equalizer 
25 (DFE) coefficients {g^^jA^p,} from a channel estimate A. As contrasted to the prior art, flie 

methodology of the present invention relies on iterating simple structured recursions to yield the 
DFE coefficients. Resultantly, the operation according to the present invention produces DFE 
coefficients faster and with less computational requirements than the prior art operations, thus 
providing significant advantages over prior techniques. 
30 According to the present invention, a channel impulse response h is first estimated based 

upon either a known training sequence or an unknown sequence. A solution to the DFE 
coefficient computation problem is then cast as a standard recursive least squares (RLS) 

4 
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problem. More specifically, the solution for Feed Forward Equalizer (FFE) coefficients (of 

the DFE) based upon the channel impulse response h is formulated as the Kahnan gain solution 
to the RLS problem. A fast recursive method for computing the Kalman gain is then directly 
used to compute . 

5 In one embodiment of the present mvention, a fast transversal filter (FTP) technique is 

employed to compute g^^. In this embodiment, the complexity of a conventional FTF 

algorithm is reduced to one third of its original complexity by choosing the length of Feed Back 
Equalizer (FBE) coefficients b^^^ (of the DFE) to force the FTF algorithm to use a lower 

triangular matrix. This technique significantly reduces the needed recursions and computations, 
10 as well as avoiding finite precision problems of a conventional FTF solution implemented in 
hardware. 

Finally, with the FFE coefBcients g^ determined, the FBE coefBcients are 

computed by convolving the FFE coefBcients g^ with the channel impulse response h. In 

performing this operation, a convolution matrix that characterizes the channel impulse response 
15 his extended to a bigger circulant matrix. With the extended circulant matrix structure, the 
convolution of the FFE coefficients g^^, with the channel impulse response h may be performed 

the convolution operations in a transformed domain. Li one embodiment, the convolution is 
performed in the firequency domain, which can be computed efficiently using the Fast Fourier 
Transform (FFT). However, in other embodiments, other transformation domams are employed 

20 for the convolution operations, e.g.. Discrete Cosine Transformation domain and the Discrete 
Hadamard Transform domain, among others. 

The present invention may also be applied to equalizers servicing systems havmg 
Multiple Inputs and Multiple Output (MIMO) DFEs. The techniques described above are 
directiy applied to such MIMO DFEs. However, the techniques are modified to support the 

25 requirements of a MIMO digital communication system. 

The method of the present invention is exact and is much faster than any prior technique 
that may have been employed to compute DFE coefficients for the same channel and DFE filter 
lengths. Because the method of the present invention relies upon simple structured reciusion, 
the computation period and/or the hardware required for the DFE coefficient computations is 

30 significantiy reduced as compared to prior art implementations. The reduction in computation 
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complexity and resultant increase in computation speed enables a DFE that operates according 
to the present invention to track fast channel variations. 

The method of the preset invention may be efficiently used ui many communication 
systems that require the use of equalizers such as mobile wireless receivers, fixed wireless 
5 receivers, cable modem receivers, HDTV receivers, etc. to increase the performance of such 
communication systems. Other features and advantages of the present invention will become 
apparent from the following detailed description of the invention made with reference to the 
accompanying drawings. 

10 BRIEF DESCRIPTION OF THE DRAWINGS 

These and other features, aspects and advantages of the present iuvention will be more 
fully understood when considered with respect to the following detailed description, appended 
claims and accompanying drawings wherein: 

FIG. 1 is a block diagram illustrating a discrete time symbol-spaced Decision Feedback 
1 5 Equalizer (DFE) channel equalization model 1 00; 

FIG. 2 is a logic diagram generally illustrating operation according to the present 
invention in determining DFE coefficients and applying such coefficients to a DFE; 

FIG, 3 is a logic diagram illustrating operations according to the present invention 
employed to determine Feed Forward Equalizer (FFE) coefficients for the DFE; 
20 FIG. 4 is a logic diagram illustrating operations according to the present invention 

employed to determine Feed Back Equalizer (FEE) coefficients for the DFE; 

FIG. 5 is a block diagram illustrating a discrete time firactionally-spaced DFE tiiat 
operates according to the present invention; 

FIG. 6 is a block diagram illustrating a multi-channel equivalent of the discrete time 
25 ftactionally-spaced DFE of FIG. 5; 

FIG. 7 is a block diagram illustrating a transceiver cotistructed according to the present 
invention; 

FIG. 8 is a block diagram illustrating a Multi-Input-Multi-Output (MIMO) digital 
conmiunication system that operates according to the present invention to equalize a channel; 
30 FIG. 9 is a system diagram illustrating a wireless digital communication system in which 

a MIMO receiver 904 operates according to the present invention; 
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FIG. 10 is a system diagram illustrating a wireless digital commumcation system that 
includes a plurality of transmitters and a MEMO receiver that operates according to the present 
invention; and 

FIG. 11 is a system diagram illustrating a wired digital communication system that 
5 includes a plurality of transmitters and a MMO receiver that operates according to the present 
invention. 

DETAILED DESCRIPTION 
FIG. 2 is a logic diagram generally illustrating operation according to the present 
invention in determining Decision Feedback Equalizer (DFE) coefficients and in applying such 

10 coefficients to a DFE. The operations of the present invention are performed by a processor, 
such as a Digital Signal Processor (DSP), or other circuitry present within a receiver that 
determines DFE coefficients to be applied to a DFE, also resident in the receiver. The DFE 
operates upon samples of a received signal in an attempt to remove channel effects from the 
samples so that digital data may be extracted from the samples. The structure and operation of 

15 DFEs, one of which was illustrated in FIG. 1, are generally known and will not be further 
described herein except as they relate to the present invention. 

A processor, tap computer, DSP, or other receiver device, to determine initial DFE 
coefficients to be used in subsequent operations by the receiver, will first perform the operations 
of the present invention. Thus, during startup or reset, a channel corresponding to the receiver is 

20 estimated (step 202). According to one embodiment of the present invention, the channel is 
estimated based upon a known preamble sequence. However, in other embodiments, the 
channel could also be estimated based upon unknown received data. In either case, channel 
estimation operations are generally well known and are not described fiirther herein except as it 
relates to the present invention. 

25 With the channel estimated, Feed Forward Equalizer (FEE) coefficients are determined 

based upon the channel estimate (step 206). Then, Feed Back Equalizer (FEE) coefficients are 
determined based upon the FFE coefficients and the channel estimate (step 208). The manner in 
which the FFE and FBE coefficients are generated is step 206 and step 208 will be described in 
detail herein with reference to FIGs. 3-7. 

30 With the FFE and FBE coefficients determined, they are applied to the DFE (step 208) 

and are used m equalizing samples of a received signal to remove chaimel effects. These DFE 
coefficients are continually updated (step 210) using a known technique. Periodically, upon the 
receipt of a next packet for example, upon an indication that a new determination is required, or 
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10 



25 



upon another triggering event, the DFE coefficients are again determined (step 212). In this 
event, another channel estimate may be obtained (step 214). Then, the DFE coefficients are 
again determined according to the present invention and applied to the DFE. The operations of 
FIG. 2 continue until the receiver is turned off, placed in a sleep mode, or otherwise inactivated. 

FIG. 3 is a logic diagram illustrating operations according to the present invention 
employed to determine Feed Forward Equalizer (FFE) coefficients for the DFE. In a first 
operation of FIG. 3, a DFE delay is selected (step 302). In one embodiment, this delay is 
selected as the channel length. In such case, the DFE delay corresponds to the length of the 
FFE. 

Next, the DFE solution is formulated into a least squares solution (step 304). By collecting 
the FFE coefficients g and the FEE coefficients b into a single vector the minimization of the 
Mean Square Error may be written as: 



minE 



Equation (9) 



Now, denoting the variance of the augmented input regression vector u, and cross 
15 variance Rtacn-s) > well-known solution to this smoothing problem is given by Equation (10) 
as: 

y^opi = K'K.in-s) ' Equation (10) 

where 



R, J 



Equation (11) 



20 and 



Using the clumnel output model of Equation (4), and the &ct that x(n) is individually 

identically distributed (i.i.d.), the following closed form expressions for 
\Ry>R^>RiyRM-s)>^^i»-s)\ ^ determined: 

= = (^U + H'H Equation (13) 
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10 



15 



i?^(„.„ = i/-Ex>(«-^) = i/*[0„, 1 or=A- 



S where ^ is a submatrix of H as set forth in Equation (S), 



H 



Equation (14) 

Equation (IS) 

Equation (16) 
Equation (17) 

Equation (18) 



/f,is defined as the (^ + l)xl submatrix of Hy consisting of the first (<y + 1) rows of /f. Note 
that for the case of colored noise, the matrix a]! should be replaced by the autocorrelation 
matrix of the noise . Extending the derivation to the colored noise case is straightforward. 
Now, with the quantities defined above. Equation (10) becomes: 



I 



- -I 


V 




0 



Equation (19) 



Using the well known inverse formula of block matrices, may be rewritten as: 



and as: 



Equation (20) 



Equation (21) 



which may be written as: 

Although each of the two matrices ffj and H^^lm^i shift structure, the augmented matrix 



El 



does not have a shift structure. 
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10 



15 



20 



In selecting the length of the FBE (DFE delay) to be M > i\r - 1 , the quantities , and 



Ho are such that: 



and 



This implies that: 



[tHH] 

h = [h, 0] 



/ + if j + H2H2 = 



Equation (23) 



Equation ^4) - 



Equation (25) 



is block diagonal. In this case, the expressions for gjp, and decouple into a simple form. 
Therefore, the optimal FFE and FBE coefficients are represented as: 

Equation (26) 



opt 



Equation (27) 



The above expressions are valid for all values of the DFE delay 5 . In general, the optimal 
value for the DFE delay 5 is within the range L-\<5^^<1<1 -^L-l, In the special case of the 
choice ^ = £ - 1 , the matrices involved in the above expressions are given by 



A(0) 

AC) 



0 
A(0) 



0 
0 



_A(^-1) /i(iV-2) ••. /j(0) 



h'(N-2) 
h'(0) 



Equation (28) 



and 



0 h(N-l) 
0 0 



h(l) 
h(2) 



Equation (29) 



.0 0 - hiN-l) 
Note that this solution is eqxiivalent to the solution of Al-Dhahir's equations, due to the 
miiqueness of w^p, when Equation (2) is minimized. However, the expressions obtained above 
for Equations (26) and (27) provide alternative methods to compute the FFE and FBE 
coefficients and in a simpler and more efficient way than the prior art techniques. 

In calculating g^ , a coefficient matrix is first defined in Equation (30) as: 
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Pe = («^v ^ + KHsY' Equation (30) 

so that the optimal solution for the FFE coefficients g^^^ is given by 

gopt = PsK ' o Equation (31) 

The expression for g^^ corresponds exactly to the definition of the Kalman gain vector, 
S used to update the optimal weights in a certain regularized Recursive Least Squares (RLS) 
problem. More specifically, given a (w+l)xX data matrix and the correspondmg 

coefficient matrix P„^ the Kalman gain g^=^P„fi^ can be time-updated according to the 
following recursions: 

r-\n) = 1 + h„P„_X > Equation (32) 

10 g„ = P„.iKr(rO . Equation (33) 

Pn = ^n-i - 8nr\ri)g„ , Equatiou (34) 

where P , = cxfl (the term P-i is the initial condition for Pn) and g^ = 0. The computation of 
the Kalman gain vector m the above solution relies on the propagation of the Riccati 
variable P^. This method of computation requires O(i^) operations per iteration, 
15 Well known fast RLS schemes avoid the propagation of P„ and compute the gain g„ in a 

more efficient way. In this case, the computational complexity required is only of 0(L) per 
iteration. This implies that tiie overallxomplexity needed to calculate the FFE coeflBcients is on 
the order of 00) thus yielding an efficient method for computmg the FFE coefficients. 
Therefore, fast RLS filters are used to determine g^^,. In such case, fast transversal 

20 computations are employed to determine the Kalman gain for the RLS solution (step 306). Here 
we also note that it is straightforward to extend this method to use other fast RLS algorithms, 
e.g., array form fast RLS algorithms. 

Faster recursions may be obtained by selecting the FBE length. Such selection eliminates 
certain variables that remain constant and equal to zero during adaptations in the special solution 

25 for gopt • RLS recursion m its explicit form propagates g„ in an efficient manner, see, e.g,: 

[1] Ljung, M. Morf, and D. Falconer, "Fast calculation of gain matrices for recursive estimation 

schemes," Int. J. Contr. vol. 27, pp. 1-19, Jan 1978); [2] G. Carayannis, D. Manolakis, and N. 

Kalouptsidis, "A fast sequential algorithm for least squares filtering and prediction," IEEE 

Trans* on Acoustic, Speech, Signal Proc, vol. ASSP-31, pp. 1394-1402, December 1983; and 

11 



10 



15 



20 



25 



30 
35 
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[3] J. Cioffi and T. Kailath, "Fast recursive-least-squares transversal filters for adaptive 
filtering," IEEE Trans, on Acoust., Speech Signal Processing, vol. ASSP-32, pp. 304-337, April 
1984. 

Table 1 lists the fast recursions employed in one embodiment for computing the 
normalized Kalman gain. The additional index included in Aj,_„ and stems ftom the fact 
that these quantities admit an order-i^ate relation, instead of time-update. 



Initialization 

n(0)=i 



For n = Oto S repeat operations CI) to (13): 

(1) a,in-\) = h{n)-h„_y„., 

(2) f{n-\) = YL{n-mn-\) 

(3) K 



0 




1 









(4) ^^(fi-l) = ^^(«-2)+a*(«-l)/(n-l) 

(5) wi.,^y^„.2^h^J{n-\) 

(6) r..,(«)=ri(«-i)|J|5| 

(7) v(/i) = (last entry of A^^,^., ) 

(9) y5(«) = ^*(«-l)v(«) 

<1°> ^^("> = 1 ,^ 

(11) b{n) = n{n)P,{fl) 

(12) C'(«) = ^*(»-l)+>5*(«)6(") 

(13) >«^=>^t., + M(«) 



Table 1: Fast Transversal Computation of flie Kalman gain. 

The purpose of computing k„ in the well-known FTF algorithm is to use it in the 
computation of the corresponding optimal least squared solution for the FFE coefficients. 
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Because we are interested only in , the filtering part of the FTF algorithm is not necessary, 
and does not appear in the algorithm listing. 

The quantities {m^,w*} are known as the least-squares solutions of the forward and 
backward prediction problems of order £, with corresponding residual errors {/(»), 6(/2)}. Now, 
5 because during the first L iterations, the desired signal for the backward prediction problem is 
equal to zero, the backward least squares solution will be equal to zero. This means that all 
quantities associated with the backward prediction problems will remain nuUs for the first L 
itemtions. Smce in our case the optimal solution is achieved exactly at the Z-tfa iteration, we can 
simply rule out the computation of these quantities firom Table 1. These operations correspond 
10 to operations (7) and (9) through (13) of Table 1. Note that operation (10) of Table 1 is also 
eliminated, since p{ri)-0, which implies //:(«) = ;'£+i(n). This means that we can replace 
operation (6) of Table 1 with: 



15 



Equation (35) 



which can be further simplified since 



^^(^1) ^^(0) C^(n-2) ^ ctI 
C^(0) ^^(1) ^^(«-l) (^^(n-l) 



Equation (36) 



20 Moreover, operation (8) of Table 1 becomes simply 



Equation (37) 



where (1:L) denotes the first L entries of 
25 Table 2 illustrates a simplified fast recursion for computing the optimal FFE coefficients 

Sopt • 



Initialization 
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10 



15 



20 



25 



r(0)=i 



For n = Oto Srepeat operations (1) through (7): 
(1) a,{n-\)^h{ri)-h„.y„_. 





l)a(n-l) 




0 


, /*(«-!) 


1 






- ""-2. 



(4) ^^(«-l) = ^^(«-2) + a*(«-l)/(«-l) 

(5) >nf-i=>»i2+*L,«.i/(«-l) 

(6) K«) = 



Table 2: Fast Transversal Computation of the FFE coefficients. 

With the Kaknan gain determined, the FFE coefScients are th^ determined (step 308). hi 
flie recursions of Table 2, the FFE coefficients are determined by setting = ir^ ^;^(^)when 

the number of iterations, n, is equal to the DFE delay. 

Unlike the conventional weighted FTF algorithm, the above recursions do not face finite 
precision difSculties for the followmg reasons. Fhrst, by ruling out the equations associated with 
the backward prediction problem, we are automatically eliminating many of the recursive loops 
that are responsible for the finite precision difficulties of the full FTF algorithm. Second, these 
simplified fast recursions have a forgetting factor >l = 1 , which yields finite precision stability. 
Third, the simplified algorithm deals with a finite set of data (5 + 1 iterations). This algorithm is 
then reset, which avoids the accumulation of finite precision mors. 

FIG. 4 is a logic diagram illustrating operations according to the present invention 
employed to determine Feed Back Equalizer (FEE) coefficients for the DFE. The FEE 

14 
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coefficients are determined according to the matrix-vector product of Equation (27). The 
computation of the feedback filter coefficients simply amounts to convolution of the channel 
impulse response with g^^, . The convolution operation that defines the optimal FBE 

coefficients in Equation (27) can be computed directly with LMI2 multiplications. 
5 Alternatively, the operations may also be efficiently performed using well-known fast FFT 
convolution techniques. 

To illustrate such operations. Equation (29) is first rewritten in Equation (38) as: 

b^Alu ^l][^o']' Equation (38) 

10 

Hie above equality holds trae regardless of the values of H^, H^, or jffj . Now, if these values 
are chosen such that the matrix C, defined by: 



C = 



H H3 
L^4 H^. 



Equation (39) 



15 



is a square (Qy-Q) circulant matrix, where Q is the smallest power-of-two integer larger than or 
equal M+ L. In this case, the matrix C is rewritten in Equation (40) as: 



C = F*hF, 



Equation (40) 



20 



where F is a (Q^Q) FFT matrix and A is a diagonal matrix that contains &e elements of the 
FFT of the first row of C. The solution for becomes: 



bo^=[ly 0]F*AF j^^J-J , Equation(41) 

25 

The complexity of this method is the complexity of obtaining flie FFT of two vectors of Q 
elements of each, the inverse FFT of another Q element vector, and Q complex multiples. Thus 
the overall complexity is 

15 
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Q + 3fiIog2(jg) . Equation (42) 

For the case of a power-of-two channel estimate the complexity is 

5 

2M + 6MIog2(2M) , Equation (43) 

Thus, referring again to FIG, 4, in determining the FEE coefficients, the convolutional 
matrix H is first determined (step 402). As was previously described, the matrix H may be 

10 determined as part of &e channel estimation. Then, the convolutional matrix ^is extended to 
form the bigger circulant matrix C (step 404). The bigger circulaat matrix C is then converted to 
the Frequency domain (step 406) as are flie FFE coefBdents (step 408). With both the bigger 
circulant matrix C and the FFE coefficients in the frequency domain, they are convolved in the 
frequency domain by simple matrix multiplication (step 410) to produce the FBE coefficients 

15 b^pf . Finally, the resultant FBE coefficients b^^, are then converted to the time domain using 

inverse FFT operations to produce FBE coefficients in the time domain (step 412). The 

FBE coefficients b^^^ are then applied with the FFE coefficients g^^ to the DFE (as previously 

described with reference to FIG, 2. 

FIG. 5 is a block diagram illustratmg a discrete time fractionally-spaced DFE that 

20 operates accordmg to the present invention. FIG. 6 is a block diagram illustrating a multi- 
channel equivalent of the discrete time fractionally-spaced DFE of FIG. 5. The approach used 
for the symbol-spaced model can be easily extended to fractionally spaced models. In this 
section, we shall derive a fast algorithm for a r/2-spaced equalizer. The fast solution for the 
general case of faster over sampling factors, say T/3, T/4, . . etc. will then follow simply by 

25 inspection of the arguments in this section. 

FIGs. 5 and 6 illustrate two equivalent representations for a discrete time model of a T/2- 
spaced equalizer. FIG. 5 illustrates a DFE 500 in terms of a down sampler and an up sampler 
device, while FIG. 6 illustrates a DFE 600 is a corresponding multichaimel representation. 
Thus, as contrasted to FIG. 1, the DFE 500 of FIG. 5 includes a down sampler 502 and an iip 

30 sampler 504 that alter the sampling frequency. Further, the DFE 600 of FIG. 6 includes a first 
channel ftmction 602 having channel esthnate and corresponding noise function Vq(w) and 

16 
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FFE 606 coefficients . Likewise, the DFE 600 of FIG. 6 also includes a second channel 
fimction 604 having channel estimate and corresponding noise function v,(») and FFE 608 
coefficients {go.gi}. 

The equivalence between the stnictures of FIG. 5 and FIG. 6 can be easily verified by 
5 writing each of the quantities {kg} and v(n) in terms of their polyphase components, and 
interchanging their position with the down sampler and the up sampler (see, e.g., P.P. 
Vaidyanathan, "Multirate Systems and Filter Banks," Prentice Hall, NJ, 1993 and JJL Treichler, 
I. Fijalkow, C.R. Johnson, Jr., "Fractionally spaced equalizers," DBEE Signal Processing 
Magazine, vol. 13, no. 3, May 1996 for details). The quantities {/lo A} feo'ft} ^® so- 
10 called polyphase components of the channel and FFEs, and are denoted by N and L size vectors 
{hQ,}\} and {go,gi} (see FIG. 6). The noise sequences {vo(«),v,(«)} are also the even and odd 

samples of v(n), with powers 2a^ . 

A model for the multichannel representation, just like the one for the symbol spaced case 
in equation (4) may be rewritten by collecting {go»&} ^ single vector g' , as: 
15 g' = [g, go] Equation (44) 

with the output of the multichannel system given by: 

I An yi,Ag' Equation (45) 

Thus, the following model of Equation (46) works for the input to g' : 

yi^l=x„[H, JyJ+K„ v,^J Equation (46) 

20 where {^o»^i} ^® convolution matrices associated with the subchannels It is more 

convenient, however, to express the iimer product in Equation (45) as a function of the original 
FFE coefficients vector g, by reordering the entries of g* . In this case, Equation (46) is replaced 
by: 

y„ = x„H' + v'„ Equation (47) 

25 where 
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h{3) 



K2) 



hi?) 



h(0) 
h(2) 



h(2N-l) h(2N-2) 



A(2iV-l) h{2N'2) 



MO) 
K2) 



Equation (48) 



h{2N-\) h{2N-2) 

Given this convolution matrix and the noise variance a^, , the solution for the fractionally space 
problem is simply given by Equations (26) and (27), with {eTj^H^o^ } replaced by {//^,-ff',of } 
(similarly to the symbol-spaced case, here we assume M>N-\), 
5 For the symbol-spaced DFE, the shift structure of implies that = „-i(l : L). 

That is, the normalized gain kj^^ can be computed fest, by performing the order update of kj^^_^ 
to , and then retaining the first L entries of ^^.+1^.1 as . The procedure to derive a fast 
recursion in the 7y2-spaced case follows the same principle. The only difference here is that the 
data matrix H'^ has now a double shift structure, in the sense that each row is formed by shifting 
10 two samples of the chamel at a time. Thus, two successive order updates from to ^^,+2^-1 
are performed and then the first L entries of A:^2^i ^ retained: 



X+Ubj-I 



Equation (49) 



so^t 

= *L.2^i (1 ' ^) Equation (50) 

15 In other words, the resulting algorithm is given by two consecutive forward prediction problems 
similar to operations (1) through (6) of Table 2 of orders L and i + 1, respectively. Table 3 lists 
the resulting algorithm for the 172-spaced case. 
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10 



15 



20 



25 



30 



Initialization 

n(o)=rt*.(0)=i 



For n = Oto S repeat operations (T) to (TT): 
ffl For q = Lto L+1 repeat operations (V) to (€): 

(1) a^(n-r) = hi2n + q-L + l)-K.,w^^, 

(2) /,(«-!) = y,(«-l)a,(n-l) 



0 




-1) 


1 






-2) 





(4) C/(n-l) = ^/(«-2) + a;(n-l)/,(/2-l) 

(5) m/-, =<^-2+ Vi/^^^-^) 



Table 3: Fast Transversal Computation of FFE coefGcients for the 7j^-spaced equalizer. 

In the general case of a T/S spaced equalizer, the fast DFE tap computation is a 
straightforward extension of the above algorithm, where g is iterated from L to (L+S-l). 
However, in the fractionally spaced equalizer, the RLS problem is formulated as a multi-channel 
problem. In such case, a multi-channel Kalman gain is calculated for the multi-channel RLS 
problem and the FFE taps are determined there from. Note that hi flie expressions of Table 3, 
successive order updates are performed. 

Now, the optimal FBE taps could now be computed as 



Equation (51) 



where is a block diagonal matrix, which satisfies 

Cs=Fs* AsFs Equation (52) 

where = F ® andQ is a block circulant matrix, which is formed by extending the matrix 

if in the same manner as in Equation (39). As was the case with the symbol spaced operations 
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described above, the convolution may be performed in a transformed domain, e.g,, Frequency 
Transformation domain. Discrete Cosine Transformation domain and the Discrete Hadamard 
Transformation domain, among others. In such case a Croniker product may be employed in 
conjunction with such domain transformation in performing multi-channel convolution in the 
5 selected transformed domain. 

FIG. 7 is a block diagram illustrating a transceiver constructed according to the present 
mvention. The components of the transceiver 700 are resident in a communications device and 
are illustrated only generally to show how the operations of the present invention would be 
accomplished in such a transceiver. The transceiver 700 includes a receiver section 702 and a 

10 transmitter section 704 and further includes either a wireless transceiver 706 or a wired 
transceiver, depending upon the system in which the transceiver 700 is implemented. In the case 
of a cellular device, RF device, satellite system device, or another wireless unplementation, the 
transceiver 700 includes a wireless transceiver. However, in the case of a cable modem, a LAN 
device, a home networking device, or another device that couples to a physical media, the 

15 transceiver 700 includes a wired transceiver 708. Further, if the present invention is 
implemented in a Serializer/Deserializer (SERDES) or similar application, the receiver section 
702 and transmitter section 704 may couple to a media without a wured transceiver 708. 

Further discussion of the transmitter section 704 and receiver section 702 are in the 
context of baseband processing. In such case, the receiver section 702 receives a baseband 

20 signal from the wireless transceiver 706 (or wired transceiver 708) that is baseband modulated 
and operates upon the baseband signal to extract data. These operations include determining 
DFE coefficients according to the present invention and operating upon the baseband signal 
using the determined DFE coefficients. 

The transmitter section 704 receives digital data to be transmitted from a host, codes the 

25 digital data into a baseband signal, and passes the baseband signal to the RF transceiver 706. 
The RF transceiver 706 couples the baseband signal to an RF carrier to create an RF signal and 
transmits the RF signal to a receiving device across a wireless link. 

The receiver section 702 receives a baseband signal that carries coded data from the RF 
transceiver 706. A Programmable Gain Amplifier (PGA) 712 adjusts the gam of the baseband 

30 signal and then provides the gam-adjusted baseband signal to an Analog-to-Digital Converter 
(ADC) 714 for sampling. The ADC 208 samples the gam adjusted baseband signal at a 
particular sampling frequency, fj (that is the symbol clock firequency), to produce samples 
thereof. 
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A processor 710 couples to the output of the ADC 714 and analyzes a preamble sequence 
contained in each received physical layer frame. Based upon the preamble sequence, the 
processor 710 determines a gain to be applied to portions of the baseband signal corresponding 
to the data carrying portions of the frame and provides this gain to the PGA 712. Further, the 
5 processor 710 may also interact with the optional timing compensation section 716 to 
compensate for symbol timing and RF carrier mismatches. 

The processor 710, based upon the preamble sequence (and based upon actual extracted 
data in some operations), also determines FFE 104 and FBE 108 coefficients. The manner ui 
which these coefficients are determined was previously described in detail herein. Further, and 
10 as was also previously described, the processor 710 may estimate a channel and calculate DFE 
coefficients based upon unknown but assumed data content. After the processor 710 determines 
these coefficients, they are applied to the FFE 104 and FBE 108 for subsequent use in extracting 
data from the baseband signal. 

The structure described in FIG. 7 may be embodied using various types of circuits 
1 5 formed using various manufacturing processes. For example, m one particular embodiment, the 
RF transceiver 706 (or wired transceiver 708) is embodied in a first integrated circuit that is 
coupled to a second integrated circuit that includes the transmitter section 704 and the receiver 
section 702, among other circuits. In another embodiment, the RF transceiver 706, the 
transmitter section 704 and the receiver section 702 are all formed on a single monolithic 
20 integrated circuit. These integrated circuits may be constructed in CMOS or another 
semiconductor technology, e.g., PMOS, NMOS, Bipolar, etc. 

Further, the receiver section 702 of FIG. 7 may be constructed usmg various circuit 
elements/combinations. In one embodiment, all structures past the ADC 714 in the receiver 
section 702 are embodied using a Digital Signal Processor (DSP) or similar processing device. 
25 In another embodiment, dedicated signal path circuitry embodies each of the structural 
components of the receiver section 702, including the processor 710. While a DSP 
implementation would provide more flexibiUty, dedicated signal path circuitry would ^ically 
provide higher performance at a lower cost and with lower power consumption. 

The structure and operation of the present invention may be employed in satellite 
30 communication systems, satellite television systems, HDTV systems, fixed wireless 
communication systems, mobile wireless communication systems, cable modem/television 
communication systems, home networking systems, wireless local area networking systems, 
wired local area networking systems, and many other types of commimication systems. The 
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present invention applies to all types of conununication devices in which equalizers are 
employed to operate upon received signals. 

FIG. 8 is a block diagram illustrating a Multi-Input-Multi-Output (MIMO) digital 
communication system that operates according to the present invention to equalize a chaimel. 
5 MIMO digital communication systems, as the term unplies, are communication systeins that 
include multiple inputs on a transmitting side and multiple outputs on a receiving side. In such 
systems, MIMO decision feedback equalization is used to mitigate inter-symbol interference 
(ISI) that results from chaimel multi path propagation. 

In the embodiment of FIG. 8, an input symbol stream includes P unique transmitted 

10 signals, represented by XqCh) , x^Cn), . . Xp_^in) (as are further shown with reference to FIGs. 
8, 9, and 10). The nomenclature of FIG. 8 is used herein to describe the method of the present 
invention. Thus, the P transmitted signals are referred to in combination as transmitted signal 
vector x(h) . The transmitted signal vector jc(«) consists of a known training sequence followed 
by unknown data. The transmitted signal vector x(n) passes through a channel 802 represented 

15 by H{z) that is represented to have N taps. The channel 802 includes additive noise v(«), 
which is white and Gaussian and has power spectral density . The output of tiie chaimel 804 
is referred to as y(n) and has a length Q (the value Q is a function of P and N). 

A MIMO DFE that equalizes the channel includes a MIMO FFE 804 having function 
G(z) and length L and a MIMO FBE 808 having function B(z) and length M (that is assumed to 

20 be greater or equal to the channel memory, i.e., M>N-l). The outputs of both the MIMO 
FFE 804 and the MIMO FBE 808 have a width P that corresponds to the multiple numbers of 
signals. MIMO decision block 806 makes soft decisions for the MIMO symbol stream and 
produces as P output symbol streams Xo(/7-^)> ^^(.^-S) ^ . . Xp^^in-S). These soft 
decisions are provided to further receiver components, e.g., a Viterbi decoder to produce hard 

25 decisions based upon the soft decisions. 

Examples of systems in which the MIMO DFE of the present invention may be 
implemented include wireless systems, e.g., cellular wireless systems, fixed local loop wireless 
systems, wireless local area networks, high definition television systems, etc., and wired systems 
in which multiple transmitters and receives share a common media, e.g., cable modem systems, 

30 etc. Diagrams representing such systems are illustrated in FIGs. 9, 10, and 11. The stmcture of 
FIG. 7 may be employed to implement the structure and operations described with reference to 
FIGs. 8-11. 
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FIG. 9 is a system diagram illustrating a wireless digital commumcation system 900 in 
which a MIMO receiver 904 operates according to the present invention. The wireless system 
900 includes a MIMO transmitter 902 that operates to receive a plurality of signal streams and to 
form a wireless signal and that transmits the wireless signal. The MIMO receiver 904 receives 
5 the wireless signal after a correspondmg channel has operated upon it. The MIMO receiver 904 
includes a MIMO DFE that operates according to the present invention to equalize the received 
signal and to produce output based thereupon. As is illustrated, the MIMO receiver 904 may 
include antenna diversity structure. 

FIG. 10 is a system diagram illustrating a wkeless digital communication system 1000 

10 that mcludes a plurality of transmitters 1002A-1002G and a MIMO receiver 1004 that operates 
according to the present invention. Each of the plurality of transmitters 1002A-1002G produces 
a respective transmitted signal that carries a respective input symbol stream, xi(n), X2(n), . . 
xp(n). The MIMO receiver 1004 receives the plurality of transmitted signals as a composite 
signal after they have been operated upon the channel, equalizes the plurality of received signals, 

15 and produces soft decisions for the equalized symbols. The wireless digital communication 
system 1000 of FIG. 10 may be embodied as a cellular wireless network, a fixed wireless access 
network, a wireless local area network, or in another type of wireless communication system in 
which a MIMO receiver 1004 receives wireless transmissions from a plurality of wireless 
transmitters 1002A-1002G. As is illustrated, the MIMO receiver 904 may include antenna 

20 diversity stmcture. 

FIG. 1 1 is a system diagram illustrating a wired digital communication system 1 100 that 
includes a plurality of transmitters 1102A-1102G and a MIMO receiver 1104 that opemtes 
according to the present invention. Each of the plurality of transmitters 1 102A-1 102G produces 
a respective transmitted signal that carries a respective input symbol stream, xo(n), xi(n), . . ., xp. 

25 i(n) and couples its respective signal to a network infrastructure 1106. The MIMO receiver 
1 104 couples to the network infrastructure 1 106 and receives the plurality of transmitted signals 
as a composite signal after they have been operated upon the channel, equalizes the plurality of 
received signals, and produces soft decisions for the equalized symbols. The wired digital 
commxmication system 1000 of FIG. 11 may be embodied in a cable modem system, or in 

30 another type of system in which the MIMO receiver 1004 receives wired transmissions from a 
plurality of transmitters 1002A-1002G 

Referring again to FIG. 8, an efficient equalization technique for equalizing a channel in 
a MIMO system includes first estimating the chaimel unpulse responses between each 
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transmitter and each receiver using the trahiing sequence, and then using this estimate to 
compute the optimal DFE tap coefficients corresponding to the estimated channel. The 
computed tap coefficients are then uploaded to the equalizer taps of the MIMO FFE (including 
MEMO FFE 804 and MIMO FEE 808) of FIG. 8. This procedure should be repeated as often as 
5 possible, especially in cases of fast varying channels. Moreover, the received data stream is 
usually buffered during the period assigned for the channel estimation and the equalizer tap 
computation. 

In this context, a critical factor for the success of this equalization structure is the 
complexity of the MIMO DFE tap computation. A fast computation technique has the foUovsing 
10 benefits; 

1 . It reduces the memory size required to buffer the received sequence during the period 
required for tap computations. 

2. It allows more firequent uploading of new equalizer coefficients, thus enabling the 
equalizer to track fast channel variations. 

15 3. It simplifies the needed hardware, especially if such computations are performed 

through structured recursive equations. 
According to the present invention, an efficient method for computing the optimal 
MIMO MMSE-DFE coefficients includes characterizing a solution to the problem of solving for 
MIMO FEE and FFE coefficients as a fast recursive-least-squares (RLS) adaptive algorithm. 
20 This fast algorithm has the following advantages over the current most efficient algorithm: (1) 
the proposed algorithm has less overall computational complexity than prior techniques in 
solving for DFEs having the same channel and DFE filters length; and (2) the technique relies 
on the use of a set of structured recursions, which makes it attractive in data-path 
implementations which do not rely on the use of a DSP core. 
25 For simplicity of presentation a symbol-spaced MIMO DFE is considered. Extension of 

the teachings provided herein to the fi:actionally-spaced case is straightforward. Further, the 
following commonly used assumptions are made herein: 

1. The input sequences to the i-th channel, {x^(«)}, are complex, independent and 

identically distributed with power cr^ , . The autocorrelation matrix of the corresponding vector 
30 process x(«) is given by: 

= I® cr/ag(a^,o,<,,...,<,,i) Equation (53) 
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where ® denotes the Kronecker product, and / is identity matrix of ^propriate 
dimensions. 

The noise sequences {v,(n)} are assumed to be white Gaussian with power alj. Tbs 
autocorrelation matrix of size LQ of &e correspondmg vector noise process v(r) is ^ven by: 
5 R» = /i®cr/asr(a^,o'<i/"v<c-i) Equation (54) 

The decisions (from decision block 806) x(n-«5) are assumed to be correct, and hence 
equal to x(n - S) . Moreover, the assumption that only previous decisions of other users are 
available at the present time, i.e., tune (n-S). It is^ further assumed that the use of current 
decisions of other users contributes with negligible improvement of the final output SNR. 
10 The MMO FFE 804 has a matrix filter G(z) with length L. The number of matrix taps 

Mof the MIMO FBE 808 having matrix filter B(z) is such that M > i\r - 1 . 

The goal in determining Ihe taps of the MIMO FFE 804 and the MIMO FBE 808 is to 
minimize the mean squared error quantity: 

^ = E||x(«-<y)-x(«-5)f , Equation(55) 
15 where x(n-S) is the delayed input signal estimate prior to the decision. By collecting the t:^ 
matrix coefficients both G(z) and B(z) into vectors, the received signal i(« - ^) is expressed as: 
jL{n-S) = y„g-x„h, Equation (56) 

where: 

y„=x,H+v„ Equation (57) 

20 H is the {N+L-l)PxLQ convolution matrix associated with the chaimel matrix 

coefficients h(0), h(l),..., b{N- 1), and is given by: 



H = 



h(0) 
h(l) 



0 
h(0) 



h(iV-l) h(N-2) 
0 h(iV-l) 



0 
0 

h(0) 
h(l) 



Equation (58) 



0 0 ••• h(iV-l) 

The row vector x„ is the 1 x (iNT + i - l)P input regressor: 

x,^[x(«) x(n-l) - xin-N-L+2)] 



Equation (59) 
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With x{n) = [xq (n) («) • • • Xp^^ (w)]. The row veptor y „ is the 1 x input regressor to 

tiie feed forward filter g, i.e., 

y„^|y(w) y(«-l) y(«-i+l)] Equation (60) 

g « col [go ,gi gjt^ ] Equation (61) 

5 Where y(n) = [yoW yi(P) J'e-iWj (g/) has si2» gxP. Similarly, x„ is the 

1 X MP input regressor to the (strictly causal) feedback filter b, i.e., 

i„t[x(n-S'l) xin-S-M)] Equation(62) 

b 4 col[bo.b„...,b^.i] Equation (63) 
Where {bj are PxP. Also, v„ is the IxLQ noise vector process. 

By collecting g and b into a single matrix w, the minimization of Equation (55) can be 
written as: 



10 



20 



minE 



x(«-^)- jy„-xJ 



Equation (64) 



Now, denoting R„ as the variance of the augmented input regression vector u, and 



as the cross variance of the augmented mput regression vector u, a solution to this 



15 smoothing problem may be determined usmg a known technique, see, e.g., T. Kailath, A.H. 
Sayed, and B. Hassibi, Linear Estimation, Prentice Hall, N.J., 2000, to produce an optimal 
solution for w according to: 

uxin-s) y Equation (65) 



where: 



and 



-R 



-R 
R, 



yi 



Equation (66) 



Equation (67) 



Using a channel ouQJut model, such as the channel output model described in T. Kailath, 
A.H. Sayed, and B. Hassibi, linear Estimation. Prentice Hall, NJ., 2000, and tiie feet that 
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{x,(n)} are individually identically distributed (i.i.d.). The following closed form expressions for 
{r, .I^j ,Rs,Ry,{^^ ,Rs^^^ j may be formed: 
R,=Ey:y.=R,+H-R,H, 
0,. 



R^=H'(ExX) = »r 



Rj -R, 



where H is a submatrix of H, such that 
"H. 



R<;> 

0 



A h'Rtf' 



H = 



H 
H 



2 J 



.where H, is (S+l)PxLQ 



Equation (68) 
Equation (69) 
Equation (70) 
Equation (71) 
Equation (72) 

Equation (73) 



Now, with the quantities defined above, Equation (65) becomes: 



0 



Equation (74) 



Using the well-known inverse of block matrices. Equation (74) may be rewritten as: 

w^pt = ^ (Rv + K^^^% + H^Rif JH2)-*h*Rif ' • Equation (75) 

Further, because M > - 1 , the quantities , Hj and Hj are such that: 



Equation (76) 



This implies the following expressions for the optimal feedback and feed forward coefficients 
may be rewritten as: 

' Equation (77) 

Equation (78) 



gopt=(R.^H:Rif>H,)"^h:Rir> 



20 where R^*^ and R^**^ are size S and P matrices. The above e>q)ressions are valid for all values of 
the delay & In general, the optimal value for the decision delay S is within the range 
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In finding a solution for g^p^ , a coefficient matrix is first determined as: 

P. = (Rv + S; H^)-' , Equation (79) 

Where the definition ^ R^'^H, is made. The optimal solution for the feed forward 
coefficients is then given by: 
5 = P^h^Ri'^ . Equation (80) 

A reader acquainted with the theory of recursive least squares algorithms might readily 
recognize that the quantity g^p^ = Pjh^ corresponds exactly the definition of the Kahnan gain 

matrix in the recursive least squares algorithm. The Kalman gain matrix is used to update the 
optimal weights in a regularized (multi channel) RLS problem. More specifically, given a 
10 (« + l)Pxijgdata matrix H„ and the corresponding coefBciait matrix P„, the Kalman gain 

Sn = ^nhl can be time-updated according to the following recursions (see e.g. T. Kailath, A.H. 
Sayed, and B. Hassibi, Linear Estimation, Prentice Hall, N.J., 2000): 

r\n) = lj. +h„P«.ihI , Equation (81) 

I« = . Equation (82) 

15 P„ =P„-, -gj^^'m: > Equation(83) 

where P., = R;* and = 0. 

The computation of g„ can be done efficiently, via a fast RLS recursion. That is, 
consider Equation (82), which can be written as: 

g« =Kr(n), Equation (84) 

20 where y(n) is defined in Equation (81). The quantity k„ - P„.i\C is referred to as the 
normalized Kalman gain matrix in adaptive RLS filtering. The key to achieving fiast recursions 
is to propagate k„ efficiently, by successive order updates and order down dates of k„ , using 

forward and backward LS prediction problems. The well-known fast transversal filter (FTF) is 
an example where such fast technique is encountered, see e.g., J. CiofB and T, Kailath, *Tast 
25 recursive-least-squares transversal filters for adaptive filtering," IEEE Trans, on Acoust, Speech 
Signal Processing, vol. ASSP-32, pp. 304-337, April 1984. 

The difference here, however, is that the entries of the regressor h„ are the channel 
matrix coefficients h(0),.,.,h(iV-l) , each of size PxQ QnUbe multi channel RLS equations, 
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the entries of the regressor are usually given by a row or a column vector, and the recursions 
have no matrix inversions). 

Now let =k„, and denote {k,„} the augmented Kalman gains that correspond to 

estimating each column of h(n) , say, [ h(n)] from [ h(n)] . h„., . In order to update k^^ 
5 efificiently without matrix inversions, we perform Q successive order iq)dates from k^^j to 
kg^_, . Now, note that in general the fast recursions that computer k^^ rely on forward and 
backward prediction problems. Here, because M'^N-l the matrix has a block lower 
triangular structure, the desired signal for the backward prediction problem is always equal to 
zero. As a result, the least-squares backward prediction solution up to time S will be equal to 
10 zero, and the fast algorithm will need only the forward prediction part. That is, 

K^i kj^i -> kg^., Equation (85) 

so that 

ko. = ke^-i 0 : GAO Equation (86) 

The resulting algorithm is then given by consecutive updates of Q forward prediction 
15 problems where we estimate the columns of h(«) from h„_i and compute the corresponding 
forward prediction matrix-valued quantities. Table 4 lists the resulting multi channel algorithm. 
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Initialization 

For (f ^OtoO-l perfonn the following initialiVari on steps: 
ro(0) = I, 



For n = OtoS. repeat overations (T). fIR. anddin : 
(T) Fora = \ to O. repeat (I) to (S) : 

(1) «r,(«-l) = [h(«)l,g.,-h^.w^^, 

(2) f,(«-l) = r,(^-l)«,(«-l) 



cc'An-l) 



(4) <-/(n-l) = C/(/i-2) + a;(«-l)f,(«-l) 

(5) y^i^-t =w^^2 +k,.,^.,f,(«-l) 

(6) M«-i)=r^.(«-i). ^-^"-f-'^"-^^ 

(H) ko„=ke^.i(l:eA:) 

(ni)ro(«)=rg('j-i) 



Finally, set g^^\,roim'^' 



Table 4: Fast Computation of for the MIMO DFE. 

The exponentially weighted FTF algorithm can become unstable in some finite precision 
implementations. Hence, the reader may wonder wheth^ this is also the case for the present 
simplified algorithm, since both have flie same essence. There are two advantages, however, 
30 that prevent the simplified algorithm from becoming unstable: 

(i) The main source of enor propagation in fte full FTF algorithm arises in the 
backward prediction section of its recursions. Here, by rulmg out the equations 
associated with the backward prediction problem many of the recursive loops that 
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contribute to the unstable behavior of the fiill FTF algorithm are automatically 
eliminated. 

(ii) Another source of instability of the FTF recursions is related to the forgetting factor 
X that appears in the exponentially weighted RLS problem. Theoretically with A, < 1, the 
redundant components generated by numerical errors would decay to zero as ^ -> oo . 
However, an averaging analysis shows that this will lead to unstable modes at VK that 
will cause instability in finite precision. Now, note that the present fast recursions deal 
with the problem of filtering a finite set of data samples of the channel model. In other 
words, for the purposes of the present invention, the algorithm must stop when n-S. 
Moreover, in the present application, the corresponding forgetting factor is always equal 
to one, in which case the recursions will present much better numerical behavior. This 
means that even if the simplified fast algorithm were to become unstable, it would not be 
likely to happen within the first S iterations. Furthermore, if this were still the case, a 
simple increase of the word length would overcome the problem. 
The computation of b^pj m equation (36) can be performed efficiently via using fast 
Fourier transform techniques. This can be done by extendmg the block Toeplitz structure of H 
to form a KPxKQ block-circulant matrix Cpg, where each block entry has size Pxg (see, 
e.g., M. Volhner, J. Gotze, M Haardt, "Efficient Jomt Detection Techniques for TD-CDMA in 
the Frequency Domam," COST 262 Workshop "Multiuser Detection in Spread Spectrum 
Communications" y Ulm, Germany, 2001). 

In the present case, there is a block diagonal matrix A^ g such that Cp ^ = F^a^ ^Fq where 
F, are block-Fourier transfonns given by = F® with 0 denotmg the Kronecker product 
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and the matrix Fiho KxK DFT matrix. The diagonal elements of g can thus be computed 
from the block-DFT of the first block-column of C^ q . The b^^ then becomes 



Hence, the complexity is given b P inverse FFTs, Q FFTs, and P FFTs of size Jf, and KPQ 
5 complex multiplies. Thus the overall complexity for computing the feedback filter is 
/CPQ + {2P + Q)Wog2(/C). 

The invention disclosed herein is susceptible to various modifications and alternative 
forms. Specific embodiments therefore have been shown by way of example in the drawings 
and detailed description. It should be understood, however, that the drawings and detailed 
1 0 description thereto are not intended to limit the invention to the particular form disclosed, but on 
the contrary, the invention is to cover other modifications, equivalents and altematives as 
defined by the claims. 
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CLAIMS: 

1. A method for computing Multi-Input-Multi-Output (MIMO) Decision Feedback 
Equalizer pFE) coefficients, the method CHARACTERIZED BY: 

estimating the channel response of a channel operated upon by the DFE; 

formulating a solution that, when solved, will yield the MIMO DFE coefficients, wherein 
the solution is formulated as a least squares problem that is based upon the channel response; 

solving the least squares problem to yield MIMO Feed Forward Equalizer (FFE) 
coefficients of Ae MIMO DFE coefficients; 

convolving the MIMO FFE coefficients with a convolution matrix that is based upon the 
channel response to yield MIMO Feed Back Equalizer (FEE) coefficients of the MIMO DFE 
coefficients. 

2. A method for computing fractionally spaced Multi-Input-Multi-Output (MIMO) 
Decision Feedback Equalizer (DFE) coefficients, the method CHARACTERIZED BY: 

estimating the channel response of a channel operated upon by the MIMO DFE; 

formulating a multi-channel solution that, when solved, will yield the fractionally spaced 
MIMO DFE coefficients, wherein the multi-channel solution is formulated as a multi-channel 
least squares problem that is based upon the channel response; 

solvmg the multi-channel least squares problem to yield fractionally spaced MIMO Feed 
Forward Equalizer (FFE) coefficients of the MIMO DFE coefficients; 

convolvmg the fractionaUy spaced MIMO FFE coefficients with a multi-channel 
convolution matrix that is based upon the chaimel response to yield MIMO Feed Back Equalizer 
(FBE) coefficients of the MIMO DFE coefficients. 

3. The method of claim 1 or 2, FURTHER CHARACTERIZED BY formulating the 
recursive least squares solution as a Kahnan gain solution. 

4. The method of claim 3, FURTHER CHARACTERIZED IN THAT the Kalman 
gam solution is determined using a Fast Transversal Filter (FTF) algorithm. 

5. The method of claim 4, FURTHER CHARACTERIZED IN THAT a length of 
the MIMO FBE is chosen to force the FTF algorhhm to use a lower triangular matrix. 
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6. The method of claim 3, FURTHER CHARACTERIZED IN THAT the Kahnan 
gain solution is determined using an Array Fonn Algorithm. 



7. The method of claim 6, FURTHER CHARACTERIZED IN THAT a length of 
5 the MIMO FEE is chosen to force the Array Form Algorithm to use a lower triangular matrix, 

8. The method of claim 1 or 2, FURTHER CHARACTERIZED IN THAT in 
convolving the MIMO FEE coefficients with the convolution matrix that is based upon the 
channel response to yield the MIMO FBE coefficients of the MIMO DFE coefficients, the 

1 0 method further includes: 

extending a convolution matrix created based upon the channel response to a bigger 
circulant matrix; and 

performing the convolution in a transformed domain. 

15 9. The method of claim 8, FURTHER CHARACTERIZED IN THAT the 

transformed domain is selected from the group consisting of the frequency domain, the Discrete 
Cosine Transform domain and the Discrete Hadamard Transform domain. 

10. The method of claim 1 or 2, FURTHER CHARACTERIZED IN THAT in 
20 convolving the MIMO FFE coefficients with the convolution matrix that is based upon the 

channel response to yield the MMO FBE coefficients of the MIMO DFE coefficients, the 
method further includes: 

extending a convolution matrix created based upon the channel response to a bigger 
circulant matrix; and 
25 computing the convolution m the frequency domain. 

11. The method of claim 10, FURTHER CHARACTERIZED IN THAT computing 
the convolution m the frequency domain includes: 

transforming the convolution matrix and the MIMO FFE coeflBcients from the time 
30 domain to the frequency domain using a Fast Fourier Transform; 

computing the convolution in the frequency domain to produce the MIMO FBE 
coefficients; and 
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transforming the MIMO FBE coefficients from the frequency domain to the time 
domain. 

12. The method of claim 1 or 2, FURTHER CflARACTERIZED IN THAT the 
5 channel response is based upon a known preamble sequence of a packet upon which the MIMO 

DFE operates. 

13. The method of clahn 1 or 2, FURTHER CHARACTERIZED IN THAT the 
channel response is based upon decisions made for data carried in a packet. 



10 



15 



14. The method of clahn 1 or 2, FURTHER CHARACTERIZED IN THAT the 
channel response is based upon: 

a known training sequence contained in a packet preamble; and 
decisions made for data carried in the packet 

15. The method of claun 1 or 2, FURTHER CHARACTERIZED IN THAT the 
method is applied within a Digital Television Broadcast system using diversity receivers and/or 
multiple receive antennas, includmg VSB and QAM HDTV systems. 
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